Skip to content

algisk/common

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Perform multiple mutations using FoldX

Description

This R script is used to replace specified multiple amino acids to a selected amino acid. It is useful for exploratory analysis and selecting potential proteins to be used in future experiments.

Initial purpose of this script was to select a target protein to be used as a template for a Poly-Phe protein in 2016 Vilnius iGEM project.


Dependencies

To use this R script, you need:

  • R ("data.table", "foreach", "ggplot2" packages)
  • FoldX
  • rotabase.txt (comes with FoldX)
  • target.pdb file

All of these need to be in the same folder


Usage

FoldX_template.R

Simply type Rscript --vanilla FoldX_template.R target.pdb number_of_runs omits.txt. The number of runs is an integer and omits.txt is an optional file containing a list of positions to ignore.

mutateDNA.R

Used with output .csv file from FoldX_template.R. The usage is as follows Rscript --vanilla mutateDNA.R arg1 arg2 arg3 arg4, where:

arg1 - .fasta file containing original DNA sequence
arg2 - mutation_file.csv from FoldX_template.R
arg3 - number of mutations to perform
arg4 - codon to change into (ttc for PHE)

Output

The script creates a folder called out_[PDB_name] which includes:

  • A list of performed mutations with free energy change (cumulative).
  • A plot showing energy change after each mutation compared to wild-type. alt text
  • Another plot showing local minimums. alt text
  • A folder containing FASTA files for each of these minimums.

Notes

Some pdb files simply do not work because of flaws in FoldX itself. Currently, only the first chain of target protein will be mutated. The target protein sequence must start at position 1.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages